An empirical analysis of word error rate and keyword error rate
نویسندگان
چکیده
This paper studies the relationship between word error rate (WER) and keyword error rate (KER) in speech transcripts and their effect on the performance of speech analytics applications. Automatic speech recognition (ASR) systems are increasingly used as input for speech analytics, which raises the question of whether WER or KER is the more suitable performance metric for calibrating the ASR system. ASR systems are typically evaluated in terms ofWER.Many speech analytics applications, however, rely on identifying keywords in the transcripts—thus their performance can be expected to be more sensitive to keyword errors than regular word errors. To study this question, we conduct a case study using an experimental data set comprising 100 calls to a contact center. We first automatically extract domain-specific words from the manual transcription and use this set of words to calculate keyword error rates in the following experiments. We then generate call transcripts with the IBM Attila speech recognition system, using different training for each repetition to generate transcripts with a range of word error rates. The transcripts are then processed with two speech analytics applications, call section segmentation and topic categorization. The results show similar WER and KER in high-accuracy transcripts, but KER increases more rapidly than WER as the accuracy of the transcription deteriorates. Neither speech analytics application showed significant sensitivity to the increase in KER for low-accuracy transcripts. Thus this case study did not identify a significant difference between using WER and KER.
منابع مشابه
Modified signed log-likelihood test for the coefficient of variation of an inverse Gaussian population
In this paper, we consider the problem of two sided hypothesis testing for the parameter of coefficient of variation of an inverse Gaussian population. An approach used here is the modified signed log-likelihood ratio (MSLR) method which is the modification of traditional signed log-likelihood ratio test. Previous works show that this proposed method has third-order accuracy whereas the traditi...
متن کاملEffect of light color temperature on selective attention, error rate and reaction time
Investigating the effect of light color temperature on selective attention, error and human reaction time Abstract Background and aims: In humans, the reaction time limit is associated with several factors. It includes the time that takes to stimulate the sensory member, the stimulus effect is transmitted to the brain, then is perceived and the decision is made; consequently, the command resu...
متن کاملKeyword-based Discriminative Training of Acoustic Models1
In this paper, we investigate a new discriminative training technique which focuses on optimizing a keyword error rate, rather than the error rate on all words. We hypothesize that improvements in keyword error rate correlate with improvements in understanding error rates. Keyword-based discriminative training is accomplished by modifying a standard minimum classification error (MCE) training a...
متن کاملAn Empirical Analysis of China’s International Reserves Demand Function
The study aims to estimate an international reserves demand model for China using economic growth, propensity to import, real effective exchange rate and trade openness variables for quarterly period spanning from 1985Q1 to 2014Q4.The bounds testing technique to cointegration is used to test for a long run relationship, while the autoregressive distributed lag approach is used to estimate short...
متن کاملEvaluation of the relationship between the uses of safety procedures in the rate of human error in Yazd Combined Cycle Power Plant
Introduction: About 60 to 90 percent of an accident in the industry is caused by human error. This study aimed to assess the effectiveness of safety procedures in reducing human error in Yazd Combined Cycle Power Plant employees. Materials and Methods: The present study is a quasi-experimental intervention that was conducted aimed to measure the human error of 121 employees of Yazd Combined...
متن کامل